area chair
The authors would like to thank all the three reviewers for their useful feedback and the area chair for handling this
To address the reviewers' comments, upon acceptance of this paper, we will (i) include numerical experiment Some common concerns are as follows. Details of this experiment will be found in final version. Reviewer 1: We thank the reviewer for providing constructive and supportive comments. They will be corrected in the final version. Details will be provided in the final version.
propagation on the DAVIS dataset (Table 1), in comparison to a SOT A3 self-supervised method [49] and the ImageNet pre-trained representation
Model J (Mean) Self-supervised, SOT A [49] 43.0 ImageNet Representation 49.4 Self-supervised, Ours 57.7 The shared affinity matrix bridges these tasks, and facilitates iterative improvements. These contributions are significant in the field of self-supervised learning. The contributions of this work are also demonstrated by our ablation study, i.e., Table 2 in the paper. We note that these components are novel and have not been explored in prior work. In the following, we address the other comments by reviewers.
Review for NeurIPS paper: UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging
The submission has received two positive and two negative reviews. The post-rebuttal discussion has not lead to convergence, and the opinion of the reviewers remain split. The concerns of the "negative" reviewers are: 1) The application is too niche (R1). However, the topic of the paper falls into NeurIPS call for papers, as it is related to low-level computer vision, compressed sensing, deep neural architectures. The authors rebut that the results in [55] were cherry-picked and that they use the code from [55], while fixing the parameters.
AI-Driven Review Systems: Evaluating LLMs in Scalable and Bias-Aware Academic Reviews
Tyser, Keith, Segev, Ben, Longhitano, Gaston, Zhang, Xin-Yu, Meeks, Zachary, Lee, Jason, Garg, Uday, Belsten, Nicholas, Shporer, Avi, Udell, Madeleine, Te'eni, Dov, Drori, Iddo
Automatic reviewing helps handle a large volume of papers, provides early feedback and quality control, reduces bias, and allows the analysis of trends. We evaluate the alignment of automatic paper reviews with human reviews using an arena of human preferences by pairwise comparisons. Gathering human preference may be time-consuming; therefore, we also use an LLM to automatically evaluate reviews to increase sample efficiency while reducing bias. In addition to evaluating human and LLM preferences among LLM reviews, we fine-tune an LLM to predict human preferences, predicting which reviews humans will prefer in a head-to-head battle between LLMs. We artificially introduce errors into papers and analyze the LLM's responses to identify limitations, use adaptive review questions, meta prompting, role-playing, integrate visual and textual analysis, use venue-specific reviewing materials, and predict human preferences, improving upon the limitations of the traditional review processes. We make the reviews of publicly available arXiv and open-access Nature journal papers available online, along with a free service which helps authors review and revise their research papers and improve their quality. This work develops proof-of-concept LLM reviewing systems that quickly deliver consistent, high-quality reviews and evaluate their quality. We mitigate the risks of misuse, inflated review scores, overconfident ratings, and skewed score distributions by augmenting the LLM with multiple documents, including the review form, reviewer guide, code of ethics and conduct, area chair guidelines, and previous year statistics, by finding which errors and shortcomings of the paper may be detected by automated reviews, and evaluating pairwise reviewer preferences. This work identifies and addresses the limitations of using LLMs as reviewers and evaluators and enhances the quality of the reviewing process.
GLIMPSE: Pragmatically Informative Multi-Document Summarization for Scholarly Reviews
Darrin, Maxime, Arous, Ines, Piantanida, Pablo, Cheung, Jackie CK
Scientific peer review is essential for the quality of academic publications. However, the increasing number of paper submissions to conferences has strained the reviewing process. This surge poses a burden on area chairs who have to carefully read an ever-growing volume of reviews and discern each reviewer's main arguments as part of their decision process. In this paper, we introduce \sys, a summarization method designed to offer a concise yet comprehensive overview of scholarly reviews. Unlike traditional consensus-based methods, \sys extracts both common and unique opinions from the reviews. We introduce novel uniqueness scores based on the Rational Speech Act framework to identify relevant sentences in the reviews. Our method aims to provide a pragmatic glimpse into all reviews, offering a balanced perspective on their opinions. Our experimental results with both automatic metrics and human evaluation show that \sys generates more discriminative summaries than baseline methods in terms of human evaluation while achieving comparable performance with these methods in terms of automatic metrics.
Evaluating the "Learning on Graphs" Conference Experience
Rieck, Bastian, Coupette, Corinna
With machine learning conferences growing ever larger, and reviewing processes becoming increasingly elaborate, more data-driven insights into their workings are required. In this report, we present the results of a survey accompanying the first "Learning on Graphs" (LoG) Conference. The survey was directed to evaluate the submission and review process from different perspectives, including authors, reviewers, and area chairs alike. The first "Learning on Graphs" (LoG) Conference (9-12 December, 2022) was remarkable in more ways than one: starting from scratch, the conference aims to be the place for graph learning research, making use of an advisory committee that consists of international experts in the field. Moreover, at is core, LoG wants to be known for its exceptional review quality.
#AAAI2023 workshops round-up 1: AI for credible elections, and responsible human-centric AI
The AAAI 2023 Workshop on Representation Learning for Responsible Human-Centric AI (R2HCAI) brought together researchers who are broadly interested in representation learning for responsible human-centric AI. The goal of the workshop was to facilitate the development and adoption of AI systems that can enhance, augment, and improve the quality of human life. We had six inspiring invited talks from renowned researchers that covered a wide range of research in the field of responsible human-centric AI. Marzyeh Ghassemi gave a talk on designing machine learning processes for equitable health systems, while Daniel Ruckert shared their recent work on human-centered AI for medical imaging. Kathy Meier-Hellstern shared a framework for responsible AI for large models, and Jacob Andreas presented their research towards natural language supervision.
What we learned from NeurIPS 2020 reviewing process
Now that the reviewing period is over, we would like to share with you some statistics and insights about the reviewing process we used this year. We received 12115 abstract submissions, which resulted in 9467 full paper submissions. Compared to 2019, the number of submissions increased by 40%, which is very similar to the growth from 2018 to 2019. After more than three months of hard work from our reviewers, area chairs and senior area chairs (thank you, all!!), we have accepted exactly 1900 papers, including 105 oral presentations and 280 spotlight presentations. Note that this year we introduced "Social Aspects of Machine Learning", with topics like fairness and privacy.
Experiments with the ICML 2020 peer-review process
The International Conference on Machine Learning (ICML) is a flagship machine learning conference that in 2020 received 4,990 submissions and managed a pool of 3,931 reviewers and area chairs. Given that the stakes in the review process are high -- the careers of researchers are often significantly affected by the publications in top venues -- we decided to scrutinize several components of the peer-review process in a series of experiments. Specifically, in conjunction with the ICML 2020 conference, we performed three experiments that target: resubmission policies, management of reviewer discussions, and reviewer recruiting. In this post, we summarize the results of these studies. Several leading ML and AI conferences have recently started requiring authors to declare previous submission history of their papers.
Avoiding a Tragedy of the Commons in the Peer Review Process
Sculley, D, Snoek, Jasper, Wiltschko, Alex
Peer review is the foundation of scientific publication, and the task of reviewing has long been seen as a cornerstone of professional service. However, the massive growth in the field of machine learning has put this community benefit under stress, threatening both the sustainability of an effective review process and the overall progress of the field. In this position paper, we argue that a tragedy of the commons outcome may be avoided by emphasizing the professional aspects of this service. In particular, we propose a rubric to hold reviewers to an objective standard for review quality. In turn, we also propose that reviewers be given appropriate incentive. As one possible such incentive, we explore the idea of financial compensation on a per-review basis. We suggest reasonable funding models and thoughts on long term effects.